Picture for Yifan Wu

Yifan Wu

OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale

Add code
Feb 05, 2026
Viaarxiv icon

Resilient Load Forecasting under Climate Change: Adaptive Conditional Neural Processes for Few-Shot Extreme Load Forecasting

Add code
Feb 04, 2026
Viaarxiv icon

Out of the Memory Barrier: A Highly Memory Efficient Training System for LLMs with Million-Token Contexts

Add code
Feb 02, 2026
Viaarxiv icon

Calibration without Ground Truth

Add code
Jan 27, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

SafeLoad: Efficient Admission Control Framework for Identifying Memory-Overloading Queries in Cloud Data Warehouses

Add code
Jan 05, 2026
Viaarxiv icon

OpenAI GPT-5 System Card

Add code
Dec 19, 2025
Viaarxiv icon

Investigating Data Pruning for Pretraining Biological Foundation Models at Scale

Add code
Dec 15, 2025
Viaarxiv icon

Coherence Mechanisms for Provable Self-Improvement

Add code
Nov 11, 2025
Viaarxiv icon

Scaling Agent Learning via Experience Synthesis

Add code
Nov 10, 2025
Figure 1 for Scaling Agent Learning via Experience Synthesis
Figure 2 for Scaling Agent Learning via Experience Synthesis
Figure 3 for Scaling Agent Learning via Experience Synthesis
Figure 4 for Scaling Agent Learning via Experience Synthesis
Viaarxiv icon